Light propagation in statistically homogeneous and 
isotropic dust universes 



Syksy Rasanen 

Universite de Geneve, Departemeiit do Physique Theorique 
24 quai Ernest- Anscrmct, CH-1211 Geneve 4, Switzerland 

E-mail: syksy dot rasanen at iki dot fi 
Abstract. 

We derive the redshift and the angular diameter distance in rotationless dust 
universes which are statistically homogeneous and isotropic, but have otherwise 
arbitrary geometry. The calculation from first principles shows that the Dyer-Roeder 
approximation does not correctly describe the effect of clumping. Instead, the redshift 
and the distance are determined by the average expansion rate, the matter density 
today and the null geodesic shear. In particular, the position of the CMB peaks 
is consistent with significant spatial curvature provided the expansion history is 
sufBciently close to the spatially flat ACDM model. 
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1. Introduction 

Observations and dumpiness. Observations of the universe are inconsistent with 
homogeneous and isotropic cosmological models with ordinary matter and standard 
gravity (meaning matter with non-negative pressure and gravity that is described by the 
four- dimensional Einstein-Hilbert action). Observational results are usually expressed in 
terms of their interpretation in the context of the homogeneous and isotropic Friedmann- 
Robert son- Walker (FRW) models, in which there is a one-to-one relationship between 
the expansion rate and the distance (given the spatial curvature). Analysed this way, 
observations imply that the expansion has accelerated in the past few billion years [1,2]. 
A model-independent statement would be that the observed distances at late times are 
a factor of about 2 higher than expected in FRW models with ordinary matter and 
gravity. This is usually taken as evidence for exotic matter with negative pressure or 
a modification of gravity. In particular, the homogeneous and isotropic, spatially flat 
model ACDM model flts observations of the distance scale and the expansion rate well 
by introducing vacuum energy. (We will refer to all homogeneous and isotropic models 
which contain only dust and vacuum energy as ACDM, whatever the spatial curvature.) 
However, the universe is known to be far from exact homogeneity and isotropy at late 
times due to the formation of non-linear structures. Before concluding that new physics 
is needed, it is necessary to quantify the effect of dumpiness on the observations. 

The influence of inhomogeneity and/or anisotropy on the average evolution was flrst 
mentioned in [3] and was discussed in detail in [4] under the name "fltting problem" . The 
effect on the expansion rate is also known as backreaction [5-7]; see [8-10] for reviews. 
The possibility that backreaction could lead to accelerated expansion and explain the 
observations without new physics was flrst considered in [11,12] (and briefly mentioned 
in [13, 14]). Both metric and matter perturbations were taken into account and the 
observables were correctly identifled in [15], where flrst order perturbation theory was 
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expanded to second order, and this was extended to a consistent second order calculation 
in [16]. In [9, 17, 18] it was explained with toy models that the physical reason for 
average acceleration is that faster expanding regions come to occupy a larger fraction 
of the volume. Accelerated expansion has also been explicitly demonstrated with the 
exact Lemaitre-Tolman-Bondi (LTB) solution [17,19,20]. A semi-realistic model with 
an evolving distribution of non-linear structures was studied in [21,22], and it was found 
that the expansion rate rises (relative to the homogeneous and isotropic case) by the 
right order of magnitude around the right time, some billions to tens of billions of years, 
though not rapidly enough to correspond to acceleration. The model involved several 
approximations, and a more careful treatment would be needed for detailed comparison 
with observations. 

However, most observations, including those of the cosmic microwave background 
(CMB) [23] and type la supernovae [24], do not directly measure the expansion rate, 
but rather cosmological distances and redshifts, which are defined in terms of light 
propagation. The few measurements that are sensitive to the expansion rate independent 
of the distance scale are those of the local Hubble parameter [25, 26], the ages of 
passively evolving galaxies as a function of redshift [27], the Integrated Sachs- Wolfe 
(ISW) effect [28,29] and the growth rate of matter fluctuations [30]. Baryon acoustic 
oscillations depend on a mixture of the expansion rate and distance [31,32]. 

The distance and the expansion rate. In addition to changing the expansion of the 
universe, non-linear structures affect the relationship between the expansion rate and 
light propagation. In a general spacetime, there is no direct relationship between the 
expansion rate and the distance scale, and it would in principle be possible to explain 
the observations without accelerated expansion. For example, in models where we are 
located near the center of an untypically large spherical void, the distances can be 
consistent with the observations, but generally there is no acceleration [33,34]. (See [35] 
for a review, [9] for more references, and [36-41] for observational constraints related 
to the inhomogeneity of these models.) Even if the large local void models are not 
realistic, studying them with the exact LTB solution has established unambiguously 
that inhomogeneities with density contrast of order unity and sizes smaller than the 
horizon can have a large impact on the distance scale. This is in contrast to perturbation 
theory arguments based on the amplitude of metric perturbations in the longitudinal 
gauge. 

The speculative possibility of an unexpectedly large local void aside, the observed 
universe seems to be statistically homogeneous and isotropic on large scales, with a 
homogeneity scale of around 100 Mpc [42,43] (though see [44,45]). (For discussion of 
statistically homogeneous and isotropic but locally clumpy dust universes, see [9,21].) 
The relevant question is then what is the effect of such a distribution of non-linear 
structures on light propagation. 

It was conjectured in [21] that in a statistically homogeneous and isotropic dust 
universe, light propagation can be treated in terms of the overall geometry (meaning the 
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average expansion rate and average spatial curvature) if the structures are realistically 
small and the observer is not in a special location. Such a conjecture is in agreement 
with various studies of light propagation (see [21] for an overview and references), and 
it is also suggested by the fact that different observations are well explained in terms 
of the evolution of a single scale factor. However, until now there had been no proof 
of the conjecture, and it was not known whether additional conditions are necessary in 
addition to statistical homogeneity and isotropy. 

In the present work, we clarify the relationship between the expansion rate and 
the distance scale in statistically homogeneous and isotropic dust universes which may 
contain non-linear structures. We relate the redshift and the distance to the dust 
geometry, and confirm that light propagation can be expressed in terms of average 
geometrical quantities, up to a term related to the null geodesic shear. In fact, the 
null shear aside, the average expansion rate (and the matter density today) is sufficient 
to determine the distance, and the spatial curvature enters only via its effect on the 
expansion rate. This implies that a clumpy model can be consistent with the observed 
position of the CMB acoustic peaks even when there is significant spatial curvature. The 
result also shows that the Dyer-Roeder prescription of multiplying the matter density 
by a smoothness factor does not correctly describe the effect of clumping. 

In section 2 we set up the dust geometry, in section 3 we relate the redshift and 
the distance to the average geometry, and in section 4 we discuss and summarise our 
results. 



2. The dust geometry 
2.1. The local equations 

The gradient decomposition. We are interested in light propagation in a dust spacetime. 
The geometry is a solution to the Einstein equation with dust matter, 

= MG^pU^Up , (1) 

where Rap is the Ricci tensor, R is the Ricci scalar, is Newton's constant, T^p is 
the energy-momentum tensor, p is the dust energy density and u°' is the velocity of 
observers comoving with the dust. 

The velocity has unit norm, UqU" = —1. Since the pressure is zero, is the tangent 
vector of timelike geodesies, u^V/ju" = 0. We can define a tensor which projects on the 
tangent space orthogonal to by 

Kl3 = gal3 + UaUfj , (2) 

where g^p is the full metric. The three- metric h^p satisfies hapu^ = 0, h^h^p = hap, 
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It is useful to express the geometry in terms of the decomposition of the covariant 
derivative of (for reviews of this covariant approach, see [46-50]): 

= -hafi() + (^afi + ^aP , (3) 

where the symmetric part 9ap = V^jsUa) is the expansion tensor and the antisymmetric 
part Uais = V[/3Mq,] is the vorticity tensor. The trace of the expansion tensor 9 = VcM" 
is the volume expansion rate and the trace-free part aa/3 = V(q,m^) — ^ha/sO is the 
shear tensor. Like h^is, the tensors a^is and are spatial in the sense that they are 
orthogonal to u°', Uapu^ = Capu^ = 0. They are also traceless, u"^ = a\ = 0. 

The scalar equations. The Einstein equation (1) can be conveniently written in terms 
of the decomposition (3) and the electric and magnetic parts of the Weyl tensor. For the 
full system of equations, see [48] (page 27). We are interested in the overall geometry, in 
other words in average quantities. Since only scalars can be straightforwardly averaged 
in a curved spacetime (though see [51-53]), we will consider only the scalar part of the 
Einstein equation, which reads 

e + -e'^ = - AttG^p - 2a^ + 2cj2 (4) 
3 

V =87rG^p-^^'^R + a^-u;'' (5) 

P + Op =0, (6) 

where a dot stands for dt = u°''Va, the covariant derivative with respect to proper time 
t measured by observers comoving with the dust, cx^ = ^Ca/Bcr"^ > is the shear scalar, 
u'^ = ^uJa/3^^°'^ > is the vorticity scalar, and ^^^R is the Ricci scalar on the tangent 
space orthogonal to The acceleration equation (4) is known as the Raychaudhuri 
equation, and (5) is the Hamiltonian constraint. 



2.2. The average equations 

Defining the average. If and only if the vorticity is zero, the tangent spaces orthogonal 
to form spatial hypersurfaces, and provide a foliation that fills the spacetime exactly 
once. These flow-orthogonal hypersurfaces coincide with the hypersurfaces of constant 
proper time of comoving observers. If the vorticity is non-zero, the hypersurfaces of 
constant proper time are no longer orthogonal to the fluid flow [46,47]. 

We assume in this subsection that the vorticity is zero, and follow the formalism 
of [6,7]. The spatial average of a quantity is then its integral over the hypersurface of 
constant proper time t orthogonal to u", divided by the volume of the hypersurface, 

ifm = 4^ , (7) 

where e^/j^ = rja/s-ysu^ is the volume element on the tangent space orthogonal to u"", 
Va/3-yS being the spacetime volume element. 
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In particular, the average expansion rate is 



t 

. 3^ . (8) 

where we have defined the scale factor a{t) as the volume of the hypersurface of constant 
proper time to power 1/3, 

.W.(li)', (9) 

and a has been normalised to unity at time to, which we take to be today. We will also 
use the notation H = a/a. 

The Buchert equations. Let us take the average of the equations (4)-(6). The resulting 
Buchert equations are [6] 

3- = -A'kG^{p) + Q (10) 
a 

3^ = 8vrGN(p)-^((=^)i?)-^Q (11) 

9,(p) + 3-(p) = 0, (12) 
a 

where the backreaction variable Q contains the effect of inhomogeneity and anisotropy: 

Q^\m-{eY)-2{a') . (13) 

The integrability condition between (10) and (11) reads 

d^ii^)R) + 2-((3)i?) = -d^Q - 6-Q . (14) 
a a 

The Buchert equations (10)-(12) describe the evolution of the volume a^ of a spatial 
domain, or equivalently its average expansion rate (9). They differ from the FRW 
equations by the presence of the backreaction variable Q, and the related fact that 
the average spatial curvature {^^^R) can evolve in a non-trivial manner, as indicated by 
the integrability condition (14), whereas in FRW universes it is always proportional to 

[54] (page 720), [55]. If the backreaction variable Q is large enough, the expansion 
will accelerate, as indicated by (10), and the spatial curvature will be correspondingly 
large, as indicated by (11) and (14) [9,56]. 



3. The light propagation 

3.1. The redshift 

Geometrical optics. When the wavelength of light is much shorter than both the 
local curvature radius and the typical scale over which the amplitude and the 
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wavelength change appreciably, light propagation can be treated in the geometrical 
optics approximation [54] (page 570), [57] (page 93). In geometrical optics, light travels 
along null geodesies, and the light rays have no effect on the geometry. The tangent 
vector of the null geodesies is given by the gradient of the wave phase 5", and it is 
identified with the photon momentum, ka = daS. The null geodesic tangent vector 
satisfies k^k"" = and fc^V^A;^ = 0. 

We will consider the propagation of a bundle of nearby null geodesies. We are 
interested in the redshift and the surface area of the bundle, the latter of which gives 
the angular diameter distance. We will not consider caustics, which are not expected 
to be important for typical light rays in cosmology (though see [58]). 

The general expression for the redshift. The spacetime geometry is determined 
dynamically by the Einstein equation (1) and traced by the dust geodesies with tangent 
vector m". Light propagation involves a derived geometrical structure, given by the 
solution of the null geodesic equations in the fixed spacetime geometry, traced by the 
photon geodesies with tangent vector /c". The tangent vectors u'^ and k'^ are parallel 
propagated with respect to the dust and photon geodesies, respectively, but not with 
respect to each other. It follows that the photon momentum changes along the dust 
geodesies. The redshift 2 of a source is defined as the observed photon wavelength 
divided by the wavelength at the source, minus one. The wavelength is inversely 
proportional to the energy E, so 

1 + ^=1^, (15) 

where s refers to the source and o to the observer. The energy is the projection of the 
momentum onto the observer's velocity, given by the tangent vector of the dust geodesic. 



It is convenient to decompose fc" into an amplitude and the direction, and split the 
direction into components orthogonal and parallel to the dust geodesies. 



where Uac" = 0, Cac" = 1. The vector is spatial in the sense that it lies in the 
three-space which has the metric h^ij, h'^^e.^ = e". 

To find out how the energy evolves along the null geodesic, we take the derivative 
with respect to the affine parameter A, 



E = —Uryk' 



(16) 



fc" = E(m" + e°) , 



(17) 



k^kf^WaUp 




(18) 
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where we have apphed the decompositions (3) and (17) and taken into account 
k°''Vak^ = 0. Note that the vorticity tensor drops out due to its asymmetry. We 
can integrate (18) to obtain E oc exp (— J d\E 9a/3e°'e'^) (the factor E is retained in the 
integrand for later convenience). Using (15) we then have for the redshift 



1 + z = exp 



exp 



dXEOaf^e' 



dXE 



3 



(19) 



where the integral is from the source to the observer along a specific geodesic. The above 
relation gives the redshift in a general dust spacetime in terms of the dust geometry 
and the spatial direction of the null geodesies. (Vorticity enters indirectly via the 
geodesic equation which determines e".) 

Looking at the redshift from the viewpoint of observers on dust geodesies, is 
constant, and the momentum decreases. However, following the null geodesies instead 
makes the relation to the change in the dust geometry more transparent. Along a null 
geodesic the momentum k" is constant, and the product —Uak'^ changes because Ua 
is turning along the photon path, which is precisely what VpUa quantifies. (Following 
the dust geodesies by taking the derivative dt = m^Vq instead of dx would lead to an 
expression for the redshift in terms of the turning of the null geodesies.) 

Let us assume that the vorticity vanishes. Then the hypersurfaces of constant 
proper time are orthogonal to m", and t{X) is monotonic. We can therefore parametrise 
points along the geodesic with the value of the proper time t on the hypersurface 
that the null geodesic is crossing^. We can then invert dx = E{u" + e")9a to obtain 
J dA = J^dtE"^, so we have 



1 + z = exp 



to 



dt 



t.x 



-e{t, x{t)) + aaf^it, a;(t))e"(t, x{t))e^{t, x{t)) 



(20) 



where the subscript A indicates that the integral is from the source to the observer along a 
specific geodesic, which crosses the hypersurface of proper time t at spatial position x{t). 



Statistical homogeneity and isotropy. In a FRW universe, the shear is zero, and only 
the expansion rate remains in the integral (20). Since the universe is homogeneous 
and isotropic, it does not matter in which direction the geodesic goes, and the result 
1 + z = a(t)~^ immediately follows from the definition of the scale factor (9). 

In a general dust universe, the shear can be important, and the result depends on 
the direction of the null geodesic. In [57] (page 136), it was assumed that the average 
of Oaf^e'^e^ along the ray reduces to the FRW expansion rate, and that this may be 
considered as part of the incomplete definition of a model being "on average FRW" . We 



:|: When rotation is present, it is not obvious that a light ray could not pass from a higher value of t to 
a smaller value. Note that light propagating from an observer with a larger proper time to one with a 
smaller proper time does not necessarily violate causality. 
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will try to make this reasoning somewhat more explicit, and take into account that the 
average expansion rate does not necessarily reduce to the FRW one. 

In a general dust spacetime, there is no direct relationship between the redshift 
(20) and the scale factor (9). The redshift is given by the integral of the local expansion 
rate and projected shear along a null geodesic, while the scale factor is given by the 
time integral of the spatially averaged expansion rate. However, in a statistically 
homogeneous and isotropic universe, the two quantities are closely related. If structures 
are identical in all directions up to statistical fluctuations, and if the coherence scale of 
the distribution (related to the homogeneity scale) is much smaller than the distance 
over which we consider light propagation, then the redshift should be the same in all 
directions, and it should not depend on the specific geodesic we are looking at. We can 
view this in the following manner. 

The dust shear aa/3 is related to the structures through which the light travels. If 
the structures have no preferred orientation, the shear is uncorrelated with the direction 
of the null geodesic, and the integral of the shear projected on the geodesic should vanish 
over long distances. In other words, the integral of 9ape°'e^ should be the same for all 
e", which implies that only the trace contributes. Furthermore, the integral of the local 
expansion rate along the null geodesic can be related to the integral of the average 
expansion rate over time. Neglecting the shear, we split the expansion rate at each 
point along the geodesic into the average value on the hypersurface of constant proper 
time at that point (given by (8)) and the variation. 



where AO = 6{t, x{t)) — {0){t). (In what follows, we will use the notation / = (/) + A/ 
for any scalar quantity /; there is no assumption that A/ is small, and hence no loss 
of generality.) If the distribution of structures evolves slowly compared to the time it 
takes for light to cross the homogeneity scale (i.e. for the null geodesic to integrate 
over a statistically homogeneous and isotropic sample), then the contribution of the 
variation A^ should be small compared to the contribution of the average (6). Consider 
two geodesies passing from the hypersurface with proper time t to one with proper 
time t + At. If the geodesies cross the hypersurfaces at different points, they will go 
through different structures on the way and will in general gain different amounts of 
redshift. However, if the structures evolve slowly compared to the passage time At, the 
distribution of structures is essentially static between t and t + At. If the distance At 
is at least as large as the homogeneity scale, the redshift gained is the same for both 
geodesies, up to statistical fluctuations. 

In the real universe, the size of structures and the homogeneity scale are indeed 
small compared to the Hubble time, which is the timescale of the evolution of the 
distribution of structures. For typical supersymmetric weakly interacting dark matter 
candidates, the size of the first structures, which form around z ~ 40-60, is of the order 
10~^H~^ [59]. The size of structures relative to the Hubble length grows as structure 
formation proceeds, and today the largest typical structures have sizes of around 10 Mpc 




(21) 
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^ 10^^ H'^. The homogeneity scale today is of the order of 100 Mpc ^ 10~'^H~^ [42,43] 
(though see [44,45]). 

If the contribution of AO in (21) can be neglected, the redshift does not depend on 
the specific geodesic, and we have 

l + z^l + {z) = a{t)-\ (22) 

where a{t) is the scale factor defined in (9). The average redshift on the hypersurface 
of proper time (z) can by statistical homogeneity and isotropy also be understood as 
the average along a specific geodesic taken over a distance longer than the homogeneity 
scale (but much shorter than the Hubble scale). In a statistically homogeneous and 
isotropic universe, the redshift is independent of direction, and it is related to the 
volume expansion rate in the same way as in the exactly homogeneous and isotropic 
FRW models. The change in the wavelength of a typical photon is, over long distances, 
only determined by the overall expansion of the volume. (This conclusion disagrees 
with [60,61], where the redshift was considered using ad hoc treatments for the spatial 
curvature; see also [55].) 

While the relation (22) between the expansion rate and the redshift is simple, the 
result is not entirely obvious. For example, it is vital that the shear and the expansion 
rate appear linearly in the redshift integral (20). In comparison, the shear and the 
expansion rate contribute quadratically to the dust equations of motion (4) and (5), 
so the shear does not drop out when considering large regions, and the variance of the 
expansion rate plays an important role. 

The parametrisation of the null geodesies in terms of the proper time is crucial. 
The evolution of structures is governed by proper time, so the hypersurface of proper 
time is also the hypersurface of statistical homogeneity and isotropy. It is the statistical 
homogeneity and isotropy which makes it possible to neglect the shear and A^, and relate 
the scale factor (9) to observables. This expresses in more detail the relation between 
the redshift, the scale factor and statistical homogeneity and isotropy discussed in [9,21]. 
For making this argument sharper, it would be worthwhile to have a more precise notion 
of statistical homogeneity and isotropy in a general dust spacetime. 

This connection with the observable redshift establishes the observational relevance 
of the expansion rate averaged over the hypersurface of constant proper time. Studying 
the average expansion rate has been criticised [62] on the grounds that it depends on the 
hypersurface of averaging [63,64], and also because the average is taken on a spacelike 
hypersurface, while observations are made along and inside the past light cone. Though 
there is a preferred foliation for dust, given by the hypersurface of constant proper time 
of comoving observers [9,21,65], ultimately the usefulness of the expansion rate averaged 
on that hypersurface is determined by relating it to observed quantities, which we have 
now done for the redshift. 

The expansion rate and the shear can have large variations along the null geodesic, 
of the order of the average values. A crucial reason for why it is sufficient to consider 
only the averages is that the coherence length of the variations is much smaller than 
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the scale over which the averages change significantly. (The large amplitude of local 
variations in the expansion rate is clear from (4)-(6) and the fact that there are large 
differences in the local density. This can be seen explicitly in the spherical collapse 
model and its underdense equivalent [66,67].) 

Deviations from the mean. In a universe which is only statistically and not exactly 
homogeneous and isotropic, the contribution of the shear to the redshift integral (20) 
is not zero, only suppressed compared to the contribution of the expansion rate. The 
fact that the null geodesies are affected by the dust shear also leads to a correlation 
between the shear and the spatial direction of the null geodesic e°. However, from 
observations it is known that the change in e'^ for typical light rays is small, at the 
percent level [68]. Similarly, because there are statistical fluctuations and because the 
distribution is not exactly static, the integral of the spatial variation of the expansion 
rate will not completely vanish even over scales much longer than the homogeneity 
scale, though its contribution to (21) will be small compared to the contribution of the 
average expansion rate. In perturbed FRW models, the contribution of the shear and 
AO reduces, in addition to the local dipole term, to the ISW effect and the Rees-Sciama 
effect. (For covariant treatment of CMB perturbation theory, see [69]. The covariant 
formalism was also recently applied to the CMB in [70].) 

Another source of variation is the fact that the cancellations discussed above 
only happen when integrating over long distances. For distances smaller than the 
homogeneity scale, directional variation in the redshift due to differences in the local 
expansion rate and shear should be expected. Typical variation of the expansion rate 
in different directions within 70 Mpc around our location was found to be 20% in [71] 
(see also [72]). For the nearby supernovae, these variations are well known under the 
name peculiar velocities. (In the comoving approach we follow, peculiar velocities are 
zero by deflnition. For a covariant way of deflning peculiar velocities, see [73].) Because 
the homogeneity scale is small compared to the distance over which most cosmological 
observations are made, the effect of these deviations is expected to be small. 

A rough estimate of the contribution of the local variation in the geometry to the 
redshift (20) would be J dtA6 ^ L{A6), where L is the size of the region where there 
is signiflcant uncancelled variation, and {A6) is the typical magnitude of the variation. 
For L = 70 Mpc and |(A^)| = 0.2H(), where Hq is the Hubble parameter today, we 
get 10"'^. This may be an overestimate, because in the linear regime perturbations in 
the expansion rate and the shear mostly cancel, apart from the dipole (for which the 
contribution of local structures is indeed observed to be 10~^). There does not appear 
to be a reason for such a cancellation in the non-linear regime, but the importance of 
the deviation from the linear results for the local universe remains to be determined§. 

§ In the average expansion rate, the deviation from FRW behaviour does not become significant until 
fourth order, because of a similar cancellation between the local expansion rate and the shear at second 
order. This can be understood in terms of the Newtonian limit [9,74]. It is not clear whether the 
cancellation in the redshift can be understood in a similar manner. 
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In any case, the correction is negligible for cosmological probes other than the CMB. 
For the CMB, the effect might give an important contribution at large angles (where the 
variation in the expansion rate is large), and could possibly explain some of the observed 
anomalies [75,76] (see [9] for discussion and more references), particularly since some of 
them are correlated with the dipole. 

Observationally, we know from the CMB that the redshift of the last scattering 
surface is the same in all directions, i.e. for different geodesies, to the level 10~^, or 
10"^ if we exclude the dipole. Assuming that our location is typical in space, this is the 
variation over the hypersurface of constant proper time. The variation is likely be at 
least as small at earlier times, because structure formation is less advanced. In addition, 
deviation of the CMB spectrum from the blackbody shape is observed to be small [77]. 
The sum of two blackbody spectra at different temperatures is not a blackbody, so finite 
angular resolution inevitably changes the spectral shape when different directions have 
different CMB temperatures [78]. Scattering of the CMB leads to a similar effect [79] 
(this was used to constrain local void models in [36]). The limit on the ^/-distortion 
relevant for both cases is \y\ < 1.5 x 10"^ [77]. So the cancellations discussed above are 
well realised in the universe. 

Inapplicability of the 'almost EGS theorem'. There is an 'almost Ehlers-Geren-Sachs 
theorem', which states that if the observed CMB anisotropy is everywhere small, then 
the universe is close to FRW [80] (see also [81]). This is contrary to our conclusion 
above. The crucial assumption in the proof of the theorem which is not satisfied in 
the real universe is that magnitude of the spatial and time derivatives of the CMB 
anisotropy AT/(T) is at most |AT|/ (T) times the expansion rate||. Here T oc E is the 
CMB temperature, and the average on the hypersurface of constant proper time (T) 
can via statistical homogeneity and isotropy be also understood as the average over the 
sky measured at one position. 

As we have seen, it is true that \AE\ <^ (E) at every point, but this does not imply 
that the derivatives of AE would be smaller than the derivatives of E. Since the local 
variation in the expansion rate is of the order of the average expansion rate, |A6'| ~ (6), 
it follows from (18) that the derivative (m" + e")da{AE) is of the order \6\{E), rather 
than |6'||A£'|. Stated the other way round, large derivatives of the CMB temperature 
anisotropies do not imply a large variation in the CMB temperature between nearby 
points. The temperature difference is the integral of the derivative over some length, and 
as long as the spatial derivative remains large only over length scales small compared 
to the inverse derivative, the difference is small. The small correlation length of spatial 
variations is at the heart of the argument which leads from the general expression for 
the redshift (20) to the simple relation (22). 

11 It has been earlier pointed out [21] that the theorem is not valid in the real universe, because it also 
indicates that the gradient of the local matter density is at most the density divided by the Hubble 
length, times the CMB anisotropy of (neglecting the dipole) 10"^. However, the reason why the theorem 
fails was not identified. 
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3.2. The distance 

The two-metric. Cosmological distances are defined in terms of observations of light. 
Two commonly used distance measures are the angular diameter distance Da, which 
measures apparent size, and the luminosity distance D^, which measures apparent 
brightness. These distance measures are in a general spacetime related to each other 
by the reciprocity relation = (1 + z^D^, so there is only one independent distance 
[47], [57] (page 111), [82]. Other distances can be defined by multiplying with different 
powers of 1 + 2;; for discussion of different distance measures in the context of FRW 
models, see [2,83]. 

It is convenient to analyse the null photon geodesies in terms of the decomposition 
of the covariant derivative of the tangent vector, like we did with the timelike dust 
geodesies. In the case of timelike geodesies, the tensor ha/s defined in (2) provides a 
natural three-metric orthogonal to the flow. For null geodesies, the situation is more 
involved, because it is not possible to construct a metric orthogonal to k°' using only gaf^ 
and fc": a new vector field is needed. There is no unique choice, and different vectors 
are used in the literature [57] (page 106), [84,85]. For example, we can use the timehke 
vector m", which is already defined. It is straightforward to identify the observed area of 
a source as the projection onto the two-space orthogonal to both k°' and the observer's 
velocity u". This corresponds to the two-metric 

hap = gap - E~'^kakf3 + E'^Uakp + E'^kaUp 

= QaP + U^Up - CaCfS , (23) 

where we have applied the decomposition (17) on the second line. The expression 
in terms of e° is particularly transparent: the two- metric hap spans the subspace of 
the three-space orthogonal to that is also orthogonal to the spatial direction of 
the null geodesic e°. The two- metric (23) satisfies h^pu^ = h^pe^ = 0, h^pk^ = 0, 
hah-yp = hap, h\ = 2. While conceptually clear, the choice (23) is not the most 
convenient for practical calculations, because is not parallel propagated along the 
null geodesic. However, the area is independent of the choice of two-metric [84]. We will 
mostly keep the two-metric completely general and use only the condition h^pk^ = 0. 

The angular diameter distance. Given any two-metric orthogonal to k°', we can 
decompose the covariant derivative of k°' as follows, 

V/j/Cq = 6ap 

= ^hapO + dap + k^aPp) , (24) 

where the trace 9 = h^^Vak'^ = Vak°' is the area expansion rate, dap = h^h^^V-yks — 
\hapd is the shear and Pa is a vector which depends on the choice of hap and plays no 
role in what follows. We have dapk^ = 0, Pak°' = 0. The vorticity is automatically 
zero (unlike for u"), because k°' is the gradient of a scalar. Also in contrast to the 
decomposition of m", the shear is not the symmetric trace-free part of the full Vpka, 
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but rather the symmetric trace-free part of Vf^k^ projected onto the two-space with the 
metric hap- Thus the shear, unhke the area expansion rate, depends on the choice of 
two-metric. 

Denoting the local scale factor which describes the linear size of the null geodesic 
bundle two-space by s{t, x), the area expansion rate is ^ = 2d\s/ s. The angular diameter 
distance is proportional to the linear size. Da oc s (see for example [85]), so 

Da oc exp Q j dX9^ . (25) 

As noted, the distance is independent of the choice of the two-metric /i^^ (in particular, 
it does not depend on the observer's velocity u""). 

Evolution of the angular diameter distance. To determine how the angular diameter 
distance changes along the null geodesic, we need the evolution equation of 6. As in the 
case of the redshift, we take a derivative with respect to A, 

= k^'K/^k-, + k^^VpV^k" 
= - Raf^k'^k^ -Vfsk^'Vak^ 

= - 2^2 - ^9' , (26) 

where Rap-yS is the Riemann tensor, and we have used the condition k'^'Vak^ = and 
the decomposition (24). On the next to last line, we have used the Einstein equation 
(1). We have defined = icr^^a'^^ > and /i^ = AiiGt^pE'^ + a'^ > 0. Note that a^, 
unlike da/s, is independent of the choice of ha/3- The Raychaudhuri equation (26) for the 
null geodesies is analogous to the Raychaudhuri equation (4) for the timelike geodesies. 

Given the relation (25) and the equation (26), we obtain the equation satisfied by 
the angular diameter distance: 

OIDa = - {AnG^pE^ + d^)DA 

= - P^Da . (27) 

The right-hand side is non-positive (and the initial condition for SxDa is negative), 
so the distance is monotonic along the null geodesic. We also see that the dust energy 
density and the photon shear can only make distances smaller (i.e. objects appear larger 
and brighter) compared to the non-sheared vacuum case. In particular, neglecting the 
null shear would give an upper bound on the distance. 

We need the evolution equation for aap, or at least for a"^. The equation for 
will be different for different choices of ha(3, since the components o"q,/3 depend on hap. 
However, the equation for can be written in the simple form 

9,^2 = ~a^^dx~aa0 
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(28) 



where CaP'yS is the Weyl tensor. We have used the properties k'^V ak^ = 0, daisk^ = 
and the relations cr^a^p = hapa"^ and o'^'^dxhap = 0. Equation (28) is not closed; as in 
the case of the dust geodesies, the shear equation of motion cannot be reduced to scalar 
form. 

For practical use, (27) should be written in terms of the observed redshift rather 
than A, and the contribution of the null shear term (28) should be evaluated. We will 
first go through this in the FRW case and then consider the clumpy situation. 

Exact homogeneity and isotropy. The symmetry of a FRW universe implies that (in 
the appropriate coordinate system) the diagonal components of should be equal, and 
the off-diagonal components should vanish, since 5""^ is a spatial tensor. This implies 
that o""^ vanishes, since it is traceless. However, the shape of the source can break the 
symmetry, generating non-zero shear. Since the Weyl tensor vanishes in FRW models, 
(28) gives 5"^ oc cx D^. So 5"^ can only decrease along the geodesic, and if it is zero 
initially, it will remain zero. We can thus neglect the null shear. 

Because of the symmetry, the distance is independent of spatial position, so 
dxDA = EdtDA = -E{1 + z)Hd,DA. Since E oc 1 + z and p oc oc (1 + zf, 
(27) reduces to 



Given the initial conditions -Da(O) = 0,9^Dyi(0) = Hq^, the distance Da is completely 
determined by the evolution history H{z) and the present value of the matter density 
Pq. The spatial curvature enters only via it effect on H. For general matter content, p 
on the right-hand side of (29) would be replaced hj p + p, where p is the pressure. 

Instead of using the Einstein equation to substitute the energy-momentum tensor 
for Rap in (26), we can express Ra/s directly in terms of H and the spatial curvature 
^^^R = 6K{1 + zY- Essentially, we are swapping p + p for the spatial curvature. This 
makes it possible to integrate (29) in a closed form, regardless of the matter content 
or the theory of gravity: the result only depends on the metric having the FRW form. 
Substituting 47rGN(p + p) = —H + K{1 + zY on the left-hand side of (29) and making 
the change of variable v = dz' /H{z'), we obtain 



Hd,[{l + zfHd,DA] 



AttG^pDa 

A'KGnpo{l + zfDA . 



(29) 




(30) 




(31) 
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The fact that the spatial curvature evolves like (1 + 2)^ in all FRW universes is the 
reason why Da can be written in this universal form, which depends only on H and K. 
However, we may equally view Da as being determined by H and p + p- While these 
quantities are related to the spatial curvature in a simple manner in the homogeneous 
and isotropic case, the situation is different in a clumpy space. 

In [86], the universal FRW relation (31) was formulated as a consistency condition 
between Da and H. (A similar consistency condition, but specific to the ACDM model, 
was presented in [87].) If the relation (31) between Da and H is violated, the metric 
cannot be the FRW one. This is a null test: if the condition is not violated, we 
cannot conclude that the metric has the FRW form, especially as recovering H from the 
observations requires assumptions about the geometry, with the FRW metric usually 
adopted to begin with. 

Statistical homogeneity and isotropy. In a general dust spacetime, the first obstruction 
to finding Da{z) is that no such function exists. While the angular diameter distance 
is a monotonic function of A, the redshift is not. As (18) shows, the redshift can both 
increase and decrease along the null geodesic. Physically this is clear: in a region which 
is collapsing (or strongly negatively sheared in the direction along the null geodesic), 
the light gains a blueshift, i.e. a negative redshift. There is a unique redshift at each 
point along the null geodesic, but more than one point may share the same redshift. 
Therefore, while the function -Dyi(A) exists (and is monotonic), the expression Da{z) is 
not single-valued, and the same redshift may correspond to several distances. This is 
true even for a single geodesic, variation in different directions on the sky aside (see [88] 
for an example). 

In a statistically homogeneous and isotropic universe which expands on average, the 
redshift accumulated over a section of the null geodesic passing through a homogeneity- 
scale sized region is always positive. So while Da{z) does not exist, there is a function 
Da{{z)). As before, by statistical homogeneity and isotropy, (z) can be interpreted as 
either the spatial average or the average over a section of the null geodesic longer than 
the homogeneity scale, but much shorter than the Hubble scale. 

In the case of the redshift, it was possible to directly integrate equation (18) in 
terms of the variable A, so we could straightforwardly discuss the smoothing in terms 
of average quantities related to the dust geometry. For the area expansion rate 9, 
or equivalently the angular diameter distance Da, we cannot write down the solution 
explicitly^. Nevertheless, we can still follow a similar line of reasoning than with the 
redshift, but consider smoothing at the level of the equation instead of its solution. The 
objective is to see whether it follows from statistical homogeneity and isotropy that the 
mean of the area expansion rate 6 dominates over the variation, |A6'| ^ |(^)| (and if 
not, what additional assumptions are needed), and find the equation relating (6) to the 

% While the equation (26) for 9 looks simple, it is a sub-case of the Riccati equation which does not 
have a general solution. Switching to Da gives (27), the one-dimensional Schrodinger equation, for 
which there is no general solution either. 
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average dust geometry. 

The source term /i^ in the equation (26) for 6 has large local variations, just like 
the right-hand side of the redshift equation (18). In fact, the variation is stronger 
than in the redshift case, since the matter density can change by orders of magnitude, 
|Ap| ^ (p). However, analogously to the redshift case, 9 can depend on /i^ only via the 
integral / dA/i^ (and its further integrals). (This is transparent with the substitution 
^ = fW ^ 2 / dXfi^ into (26).) We can write this quantity as 

J dA/i^ = J dA(47rGNP^^ + a^) 
= I dt^{ATiG^pE^ + a^) 

J A 

^ ^dt^[4vrGN((p) + Ap)(i?)2 + (a2)+Aa2] , (32) 

where we have again assumed that vorticity vanishes so that we can parametrise points 
along the geodesic uniquely with t, and we have taken into account |Ai?| <C {E). 

For the matter density, we can use the same reasoning as with the expansion rate 
in (21) to argue that the contribution of Ap to (32) is subdominant to the contribution 
of (p) over sufficiently long distances. (Via (6), the matter density is related to the 
expansion rate by p{t,x) = p{tQ,x)e" ^^o^^^ .) In fact, the argument is now stronger, 
because fluctuations in the density necessarily cancel due to conservation of mass. 
In [89,90], the line average of the density in a Swiss cheese model was found to be smaller 
than the volume average. However, the structures in the model are not distributed in a 
statistically homogeneous and isotropic manner (and they are also unrealistically large). 
Randomising the structures leads to a suppression of the deviation from the FRW result 
for the distance [90-92], and we expect this to hold also for the density. 

There is one additional complication compared to the redshift case, namely 
correlation between Ap and Ai?. We have neglected AE because it is small compared to 

and Ap because its mean vanishes. However, Ai? and Ap are correlated (the density 
is anti-correlated with the expansion rate, which is correlated with the redshift), so the 
contribution of the term 87rGNApAi?(i?) in (32) does not vanish over long distances, 
and |Ap|/(p) can be locally orders of magnitude higher than unity. However, highly 
overdense regions can take up only a small fraction of the volume, because mass is 
conserved. In particular, since |Ap|/ (p) in underdense regions cannot exceed unity, the 
typical value in overdense regions is also at most unity, and the overall mean amplitude 
of |Ap|/(p) cannot compensate for the suppression factor \ AE\/{E) ~ 10~^. 

For the null geodesic shear, there is no such simple argument. Integrating (28), we 

have 

^^(A) =a^(Ao)e-^4^^'2^'-e-^^>^'^^' [\x'e<''"'' k^k^a^^C^^.s ■ (33) 

The first term vanishes if the initial shear is zero, and anyway decreases along the null 
geodesic, as in the FRW case. The second term is less straightforward. Using the 
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decomposition (17), the Weyl term in the integrand can be written as 

= 2E\E^^ + ~e^^^^HP\)d^p , (34) 

where we have decomposed the Weyl tensor in terms of its electric and magnetic 
components, defined as Eap = Ca-tpsu^u^ , Hap = \^a^C^si3^u^ , and we have denoted 
the volume element on the two-space orthogonal to both and e" as lap = ^ap-^e^ . 
The tensors Eap and Hap are traceless, symmetric, and orthogonal to Writing the 
null shear in terms of e° and quantities related to the dust geometry using (3) and (17) 
and adopting the two- metric (23), we have 

= 2E''E^f [hfhp' + ^^e^epW"^ {a^s + V(^e,)) 

+ 2E^mPh^~ep\a^s + V(^e5)) . (35) 

We could argue that if there is no preferred direction on the two-space orthogonal 
to M° and 6°^, the diagonal components of Hap, as well as + V {a^p), in the directions 
parallel to that space should contribute equally when integrated over long distances. 
Then the contribution of the magnetic Weyl term on the second line of (35) to the 
integral (33) would vanish. A similar argument for Eap would leave a product of the 
off-diagonal terms of Eap and aap + ^ [a^p), and it is not clear why its integral would 
vanish. We could try to formulate an argument along the lines that the contribution 
of the term involving V(ae^) should vanish in a statistically homogeneous and isotropic 
space, since the result should not depend on e" (though this is not obvious), but this 
would still leave the dust shear. There is no clear symmetry reason for E^p and the 
dust shear a^p, or in terms of (34), the null shear dap, to be uncorrected, especially 
as the Weyl tensor acts as a source for the null shear. We cannot get rid of all terms 
not directly related to the average dust geometry with symmetry arguments leading to 
a lack of correlation over long distances, unlike in the case of the redshift integral (20). 

However, if the amplitude of the Weyl tensor is highly suppressed compared to 
the Ricci tensor (specifically, to ^TiG^{p)), the null shear can be neglected in (32), 
regardless of the correlations of the Weyl tensor. In general, the components of the 
Weyl tensor are locally not smaller than the matter density. (In particular, in vacuum 
regions the Ricci tensor is zero, and the curvature is manifested entirely via the Weyl 
tensor; see [93] for an example in a Swiss cheese model.) However, it is possible that 
in a statistically homogeneous and isotropic space the contribution of the Weyl tensor 
to scalar observables is small compared to the contribution of the Ricci tensor, when 
integrated over scales larger than the homogeneity scale. 

From observations, the null shear is known to be small in the real universe [68]. 
The smallness of the shear is theoretically supported by studies of various models which 
find only small effects on the distance when the expansion rate is the FRW one (see [21] 
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for an overview and references). Nevertheless, since we do not have a general theoretical 
argument for the smallness of the Weyl contribution, we will retain in the equations. 
Even though the Weyl tensor (and its relative contribution to /i^) can vary strongly 
between different regions, we assume that since the Weyl tensor affects 6 only via a 
double integral, the contribution of the variation Aa^ is small compared to the mean 
value (5"^), when integrated over long distances. 

In addition to J dXfi^, the area expansion rate 6 can explicitly depend on the affine 
parameter A. So we should also divide A into the mean and the variation. We have 
/ dA = ^ ^ I ~ Ix '^^■^p' ^g^iii assuming that vorticity vanishes so that we can 
parametrise the null geodesic with t, and taking into account \AE\ <^ (E). This gives 
(A) and AA, and shows that the variation of A on hypersurfaces of constant proper time 
is small, |AA| <C (A). In the first term, we have dropped the subscript A to indicate that 
it is independent of the specific geodesic. We can write"*" (with some abuse of notation) 

el^xj dA/i^) ^^~((A) + AA,| + ^ (36) 

where we have taken into account \AE\ <^ (E). The correction terms in both arguments 
of 6 are small compared to the mean, so if we expand 6' in a Taylor series around the 
average values of the arguments, the next order terms are suppressed and we have the 
result |A^^| ^ |(^)|- (It is crucial that 6 depends on /i^ only via an integral: otherwise 
the result would not hold, since |A/i^| ^ (/U^)-) 

We cannot simply substitute 6 = (6) + A6 into (26) and drop all terms involving 
A9, because, as in the redshift case, we do not in general have |(9a(A6')| -C \dx{0)\. 
In fact, |9a(A^)| ~ |A/x^| ^ (/x^) ~ \d\{9)\. However, we can integrate (26) once to 
get an equation where J dX6^ appears instead of d\6. All terms involving A^ are then 
subdominant, and can be dropped. The resulting equation depends only on the time t, 
not on the spatial coordinates. Taking a time derivative, we obtain 

{E)m + l{0)'= -2(/i2) 

= - 87rGN(p^') - 2(^2) 

^ -87rGM{E)'-2{a') , (37) 

where we have taken into account \AE\ <^ (E). This is the equation that we would have 
gotten by naively replacing all quantities by their averages in (26), putting d\ {E)dt, 
and neglecting variance and the non- commutation of taking the derivative and averaging. 

We now have all the ingredients for the equation for the angular diameter distance 
in terms of the average redshift. From the relation (25) we get, using similar reasoning 
as above, {Da) oc exp ^| J -^{9)^ and |AD^| ^ {Da)- Inverting, we have (6*) = 

2{E)dt{DA)/{DA). Recall that E oc {1 + z) ^ 1 + {z) = a-\ and H = d/a. From (12) 
we have {p) oc oc (1 + {z))^. Putting the pieces together, we obtain 

[(1 + {z))'Hd^,}{DA)] = - [AnGM + {E)~'{a')] {Da) 

= - [47rGN(po)(l + {z)f + {E)^\a')] {Da) , (38) 

"•" We omit possible dependence on further integrals of fP. 
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where 



{E)-'{&') = 2(1 + {z)r'{D^)-' I ^(1 + z'){Da)' X 



{a^s + W{^es)) ) , (39) 



1 

— ( 
2 

where we have assumed that the initial shear is small and can be neglected, and we have 
taken into account |AD^| <^ (-Da). 

The equation (38) is our main result for the distance. It shows that the average 
angular diameter distance can be written in terms of the average dust geometry, plus the 
null shear. Aside from the null shear term, (38) has the same form as the corresponding 
FRW equation (29) in the case when p + p oc (1 + 2)^, i.e. when the matter consists 
of dust and vacuum energy. When the null shear is negligible, differences between the 
distances of a clumpy model and the ACDM FRW model are completely encoded in the 
expansion rate and the redshift. A clumpy model with the same H{{z)) (and present-day 
matter density) as the ACDM model has the same average distance-redshift relation, 
even though the spatial curvature will in general evolve quite differently, as discussed 
in section 2.2. (This conclusion disagrees with [60,61,94], where the distance was 
considered using ad hoc treatments for the spatial curvature; see also [55].) This is in 
contrast to FRW models with matter other than dust plus vacuum energy, where the 
distance deviates from the ACDM case because of a different p + p in addition to a 
different H. Of course, this depends on writing the equation for the angular diameter 
distance in terms of H and p + p instead of H and the spatial curvature. In the FRW 
case, it is possible to eliminate p + p in favour of the spatial curvature. Let us see why 
this does not work in a clumpy universe. 

Analogously to the FRW case, we can use (10) and (11) to substitute AtiG^{p) = 
—H + {^^^R)/6 + Q/2 on the left-hand side of (38) and make the change of variable 



V 



/J'^ dz'/H{z') to obtain 



d^D = - + + {Er^a')^ (1 + {z))-'D , (40) 

where we have denoted D = (1 + {z)){Da)- We could express Q in terms of {^^^R) 
using the integrability condition (14). However, there is a simple integral (31) in terms 
of V only when the right-hand side of (40) is independent of (z), which requires the 
expression inside the parentheses to be proportional to (1 + {z)Y, such as when Q = 
and (5-2) = 0. (When Q = 0, it follows from (14) that {^^^R) oc a^^.) 



4. Discussion 



4-1- Observational and modelling issues 

Spatial curvature and the position of the CMB peaks. The position of the CMB acoustic 
peaks is often considered to be a measurement of spatial curvature. The reason is 
that the peak location in multipole space corresponds to the apparent size of the last 
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scattering sound horizon, which provides a measure of the angular diameter distance 
to the last scattering surface ai z ^ 1100 [95] (page 99). The observed position of the 
peaks is consistent with spatial flatness in a ACDM universe [23] . 

However, the peak position does not imply spatial flatness, even in a FRW universe. 
It is clear from the way H and K enter the distance (31) that for any value of K, it is 
possible to adjust H to compensate for the spatial curvature so as to keep the distance 
fixed. For example, the peak position is consistent with a FRW universe with large 
positive spatial curvature [96] . Such a model is not viable due to other constraints, such 
as the value of the Hubble parameter today. By replacing the vacuum energy with exotic 
matter with a time-dependent equation of state, it is possible to do a similar adjustment 
and allow spatial curvature without changing the expansion history as radically [97,98]. 

In the FRW case, it is true that given the expansion history H{z), the peak position 
provides a measurement of the spatial curvature. The root of this argument is the 
relation (31), which expresses the distance in terms of the expansion rate and the 
spatial curvature. In a clumpy universe the distance is instead completely fixed by the 
expansion rate and the matter density (as well as the null shear), according to (38)*. 
As (40) shows, the expression (31) is inapplicable due to the non-trivial evolution of the 
spatial curvature as well as the fact that clumping contributes to the expansion rate via 
Q. In terms of the density parameters defined by dividing the expansion law (11) by 
3if ^, the density of matter and the density of curvature do not sum to unity, because of 
the contribution of Q [6,99]. In a FRW model, any additional contribution that changes 
H enters the distance also via p + p (which is related to the spatial curvature), but that 
is not the case here. In the FRW case with arbitrary matter, the spatial curvature is 
always proportional to (1 + z)"^, and the evolution of p + p is not fixed, while in a clumpy 
dust universe, p + p is always proportional to (1 + -z)^, but the evolution of the spatial 
curvature is complicated. 

It is somewhat trivial that the spatial curvature enters the distance only via its 
effect on the expansion rate, since the spatial curvature can be written in terms of H, 
H and (p) oc (1 + (z))'^ using the Buchert equations (10) and (11). However, it is not 
obvious that the equation for the average distance (38) depends only on H and the 
matter density, or that the dependence on H is the same as in the FRW case, so that 
we recover the ACDM equation (because p + p is the same in both cases). 

In summary, the CMB peak position can be consistent with large spatial curvature 
in a clumpy model. All that is required is that the expansion history and the matter 
density today is sufficiently close to that of the spatially fiat ACDM model. (Since the 
Hubble rate enters via an integral, significant variation in H{{z)) is still allowed [100].) 

The effective equations of state. Observations of distances are typically analysed in 
terms of an effective equation of state in a FRW model. (Regarding the dependence of 

* Note that we are comparing a clumpy universe with only dust to a FRW universe with arbitrary 
matter content. If we allowed other matter in the clumpy situation, the distance would also depend on 
the non-trivial (p) + (p). In this case, the Buchert equations would also be more complicated [7]. 
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the results on the adopted parametrisation for the equation of state, see [100].) The 
equation of state determines the evolution history H{z), which then gives the distance 
Da{z) via (31). If we want to express the evolution of a clumpy dust model this way, 
there are two different effective equations of state, because the relation between H and 
(Da) is different than in FRW models. 

For the expansion rate, we can define an effective equation of state wh{z) such 
that a FRW model with this equation of state would have the same expansion history 
as the clumpy model. Formally this is done by writing the spatial curvature and 
the backreaction variable Q as a single component in the Buchert equations (10)- 
(12) [7, 101]. For comparison to distance observations, we should introduce another 
equation of state wc^z), defined so that the resulting H reproduces the real distance 
function (Da) when plugged into the FRW distance formula (31). In both cases, we 
have to make a choice for the spatial curvature of the FRW fitting model. The simplest 
choice, which does not involve any loss of generality, is to take the FRW model to be 
spatially fiat. In general, wh wd, so the effect of dumpiness cannot be treated just 
as an effective source in the FRW equations. This limits the usefulness of an effective 
description of backreaction in terms of a scalar field [101] (also, wh can violate the null 
energy condition, unlike the equation of state of a scalar field [21]). 

Until calculations of the impact of structure formation are accurate to more than 
an order of magnitude [21,22], it is not known how large the expected difference between 
Wh and wd is. However, even without a theoretical prediction, it is possible to test the 
null hypothesis that the equations of state inferred separately from observations of the 
expansion rate and distance do not show any difference. This is the essence of the FRW 
consistency check proposed in [86]. 

At the moment, while distances are measured relatively well, there are few 
observations probing the expansion rate as a function of redshift independent of the 
distance scale. The ages of passively evolving galaxies provide an interesting way to 
measure the expansion history, but at the moment the constraints are rather weak [27]^. 
Another measure is provided by baryon acoustic oscillations, which are sensitive to a 
combination of expansion rate and distance [32] . (This was used to constrain local void 
models in [41].) There does appear to be some discrepancy between the observations 
of the luminosity distance of type la supernovae and measurements of baryon acoustic 
oscillations, but only at the 2a level [102], so it is not statistically significant. When 
analysing the data in the context of FRW models, this discrepancy would be interpreted 
as a violation of the reciprocity relation = (1 + z^Da instead of the FRW relation 
(31) between H and Da- 

The expansion rate also has a role in the ISW effect, i.e. deviations of the redshift 
from the mean value (z) in different directions, as well as in the growth of density 
perturbations [30]. The ISW signal is slightly higher than the spatially fiat ACDM 

In a space which is not statistically homogeneous and isotropic, this data does not measure purely 
the expansion rate, since the shear also contributes to the redshift (20). This was used to constrain 
local void models in [39]. 



Light propagation in statistically homogeneous and isotropic dust universes 22 

prediction, but the difference is not statistically significant (it is evaluated as 2a in [28] 
and la in [29]). Neither the ISW signal nor the growth factor can be used at present to 
put accurate constraints on if as a function of redshift. 

Average observables. The equation (38) determines the average angular diameter 
distance as a function of the average redshift, given the average expansion rate. The 
averages are taken on the hypersurface of constant proper time. However, we observe 
the redshift and the distance only in one fixed location. Nevertheless, for practical 
purposes, the averages (Da) and (z) do correspond to directly observable quantities. 

In order to model what could be observed in principle, we would have to know the 
details of the structures along each line of sight to calculate the relation between and 
z for each direction. As noted earlier, the distance-redshift relation cannot in general be 
expressed as a function Da{z), because several values of can correspond to the same 
redshift. In practice differences between redshifts corresponding to the same distance 
are likely to be smaller than the observational resolution, except for nearby sources (or 
the CMB, for which the redshift is very accurately measured), because the regions which 
are collapsing (or have strong negative shear in the direction of the null geodesic) are 
small. Using the redshift integral (20), a naive estimate of the blueshift due to a region 
one Mpc across which is collapsing with a rate of the present-day Hubble parameter is 
10~^. As long as the variations in Da and z are smaller than the observational errors, we 
can safely say that the averages correspond to the observed quantities. Observationally, 
the variation of the CMB peak position with direction is known to be small [103] . Note 
that standard CMB analysis also predicts only an ensemble average, and that in practice 
cosmological observations are often analysed using an average over the full sky, which 
by statistical homogeneity and isotropy corresponds to the average over the spatial 
hypersurface. 

The Dyer-Roeder approximation. An approach where the effect of clumping on light 
propagation is modeled assuming that the light rays encounter only a fraction of the 
mass in the universe was introduced by Zel'dovich [104] and is known as the Dyer-Roeder 
approximation [105]. In this prescription, the FRW equation (29) for the distance is 
modified by multiplying the matter density by a constant a, which varies between 
and 1, corresponding to a universe where the lines of sight are completely empty or 
completely filled with matter, respectively. The smoothness parameter a was generalised 
to a function of redshift in [106] to account for the evolution of structures, and change 
of the expansion rate due to clumping was added to the equation in [107]. 

According to our result (38), clumping is irrelevant for the contribution of the 
matter density, which is always proportional to (1 + {z)Y, with no extra prefactor. The 
reason is that mass is conserved, so if the line of sight goes through an underdense 
region somewhere, it must correspondingly go through overdensities elsewhere when 
considering distances of the homogeneity scale or larger, as discussed in connection with 
equations (21) and (32). The Dyer-Roeder parameter a is always unity, and clumping 
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enters instead by changing the expansion rate. In addition, there is the null shear 
term; if interpreted as an effective, redshift-dependent a, it would correspond to a > 1, 
contrary to the Dyer-Roeder case. In summary, the Dyer-Roeder prescription does not 
correctly describe the effect of clumping. 

One possible caveat is that we have not taken into account the possibility that 
the observer or the sources could be in a special location, or that observations could 
be made along special lines of sight. For example, we would expect supernovae to 
be preferentially located in very overdense and thus highly untypical regions. The 
possibility that observations might be biased towards empty lines of sight has been 
brought up in [107,108]. 

The effect of the location of the observer is likely to be small, because the deviation 
from the mean is significant only over regions which are small compared to the overall 
distance travelled by the light, and the amplitude of the deviation is typically not 
correspondingly large (except in very special locations, such as near a black hole). This 
is another way of saying that the homogeneity scale is small. For the same reason, the 
location of the sources is not expected to have a large impact, though there could be 
a secular effect, as the degree to which the source locations are untypical could evolve 
with redshift. In any case, cosmological observations rely on various different sources, 
not only supernovae. Since different observations roughly agree, these kind of selection 
effects must be subdominant. This is also an argument against large effects due to 
special lines of sight used in observations. In particular, the CMB covers the full sky, so 
it is not subject to this kind of bias (apart from some uncertainty in the direction of the 
Galactic plane). Furthermore, since most cosmological observations (apart from nearby 
objects) are made over scales much larger than the homogeneity scale, the variation in 
the density along lines of sight should be small, and empty lines of sight should be very 
rare. Note that a clear line of sight is not necessarily empty, because most of the dust is 
dark matter, which is transparent. (See [102] and references therein for more on cosmic 
transparency. ) 

4-2. Conclusion 

Summary. It was conjectured in [21] that light propagation in a statistically 
homogeneous and isotropic dust universe which may contain non-linear structures can 
be expressed in terms of average geometrical quantities (namely the expansion rate and 
the spatial curvature), assuming that the observer is not in a special location, and that 
structures have realistically small sizes. As reviewed in [21], the literature on light 
propagation is mostly in agreement with this statement, but there had been no proof 
thus far. 

We have now derived the equation for the angular diameter distance, assuming that 
the dust universe is statistically homogeneous and isotropic as well as rotationless, and 
that structures are small and their distribution evolves slowly compared to the time it 
takes for light to cross the homogeneity scale. It follows from these assumptions that 
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the distance can be expressed in terms of the average dust geometry, apart from a term 
related to the null geodesic shear. Of the average geometry, the only term that is required 
is the average expansion rate, along with the present value of the matter density. In 
particular, the spatial curvature enters only via its effect on the expansion rate, unlike in 
Friedmann-Robertson- Walker (FRW) models with arbitrary matter content. Therefore, 
significant spatial curvature is not necessarily inconsistent with the position of the cosmic 
microwave background acoustic peaks. If the expansion history and present-day matter 
density of a clumpy model is close to that of the spatially fiat ACDM FRW model, the 
peak position will also be near the ACDM case, and thus consistent with observations. 
This is important, because accelerated expansion due to structure formation involves 
large negative spatial curvature [9,21,56]. The result also shows that the clumping is 
not correctly described by the Dyer-Roeder approximation, which changes the evolution 
of the matter density (which is in fact fixed) and misses the change of the expansion 
rate. 

To complete the proof that the redshift and the distance can be expressed in terms 
of the average geometry alone, it would be necessary to show that the contribution of 
the null shear can be neglected. Observationally, the shear is known to be small [68], 
but this should be shown to follow from statistical homogeneity and isotropy (or, if this 
is not the case, the required additional assumptions should be identified). 

Since the distance-redshift relation is determined by the average expansion rate, we 
should calculate the average expansion rate in a realistic model to evaluate the effect of 
structure formation on cosmological observations. Using the Buchert equations [6], this 
backreaction can be determined from purely statistical quantities (the variance of the 
expansion rate and the average dust shear). Evaluating these quantities in a realistic 
model is a challenging task, especially as the backreaction is a general relativistic effect 
related to spatial curvature and has no counterpart in Newtonian gravity [5,6, 10,21]t|. 
A first step was taken in [21,22], where the average expansion rate was calculated in 
a semi-realistic model with an evolving distribution of structures. To compare with 
observations of distance and the expansion rate in detail, a more rigorous treatment is 
needed, with well-quantified errors. After that, the calculation of fluctuations around 
the average should follow, in order to evaluate the Integrated Sachs- Wolfe effect and the 
growth of density perturbations. 

Before detailed analytical results on the effect of structure formation are worked 
out, it is possible to make general tests. The relation between the expansion rate and 
the distance scale in clumpy models is different than in FRW universes. Therefore, 
observations which probe the expansion rate and the distance separately can be used 
to test the null hypothesis that the two are related by the FRW consistency condition, 
as proposed in [86]. Comparing observations of type la supernovae, baryon acoustic 
oscillations and the ages of passively evolving galaxies seems promising in this respect. 

ft Regarding the differences between Newtonian gravity and tlie weak field fimit of general relativity, 
see [47,109-111]. 
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